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Abstract 

Consider the process of random transpositions on the complete graph K n . 
We use representation theory to give an exact, simple formula for the expected 
number of cycles of size k at time t, in terms of an incomplete Beta function. 
Using this we show that the expected number of cycles of size k jumps from 
to its equilibrium value, 1/k, at the time where the giant component of 
the associated random graph first exceeds k. Consequently we deduce a new 
and simple proof of Schramm's theorem on random transpositions, that giant 
cycles emerge at the same time as the giant component in the random graph. 
We also calculate the "window" for this transition and find that it is quite 
thin. Finally, we give a new proof of a result by the first author and Durrett 
that the random transposition process exhibits a certain slowdown transition. 
The proof makes use of a recent formula for the character decomposition of the 
number of cycles of a given size in a permutation, and the Frobenius formula 
for the character ratios. 



1 Introduction and main results 

Consider the complete graph K n on n vertices, and let a t be the random walk on S n 
that results when considering the interchange process on K n . That is, a t is the usual 
random transposition process (see e.g. [14]) sped up by a factor (™). 

Our first result gives an exact and surprisingly simple formula for the expected 
number of cycles of size k at time t. 
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Theorem 1. Fix any 1 < k < n and let Sk{t) be the number of cycles of size k at 
time t in a t . Then 



E(s fc (t)) 



n 



where 4>{y) = y n (1 — y) 1 and x 



-x(f)(x) + 
k 



-tk 



<fr(y)dy 



This integral is known as the incomplete beta function (the integral from to 1 
is the regular beta function). The proofs of Theorem 1 is based on representation 
theory. The key argument is a formula of Gil Alon and one of us [1] for the character 
decomposition of the number of cycles of a permutation, as well as Frobenius' formula 
for the values of the character ratios. 

Our second result uses the above formula to show that in the limit as n — > oo, 
the quantity E(sfc(t)) exhibits a sharp transition from the value to 1/k at a time 
t n ^ which is essentially (— l//c)log(l — k/n). We note that this time is an order 
magnitude smaller than the mixing time for a t which (with this parametrization) is 
(logn)/n (see [7] or [4]). The width of this transition is shown to be order 1/n 3 / 2 
when k is of order n (which corresponds to a width of order y/n in the traditional 
scaling of random transpositions). This is reminiscent of the cutoff phenomenon for 
the mixing time, see [7], [12] or [11, §18] for the cutoff phenomenon. 



Theorem 2. Let 1 < k < n and let t nk be the unique t such that e 
1). Then 



-kt 



in- 



k)/(n- 



E(s k (t)) - ^l { t>t n>k} 



< Cq exp {— c(n — k) min{|t — t n ^\ 2 k 2 , 1}} 



where q = q n (k) is a polynomial factor, q = n 3 ^ 2 k 3 ^ 2 (n — k) 1//2 . 

(Here and below c and C pertain to unspecified positive universal constants, possibly 
different from one place to another). 

At first sight it might seem as if Theorem 2 cannot possibly hold. After all, a 
large cycle of at (say of size |n) is necessarily contained in the giant component of 
a corresponding Erdos-Renyi graph G(n,t) with edge density t (see e.g. [14]). It is 
easy to check that t n ^ is the first time that the giant component has a relative size 
which exceeds 1/3, so it is clear that E(sfc(t)) must be close to before t n> k- What 
Theorem 2 says is that E(sfc(t)) suddenly reaches its equilibrium value precisely at 
that time, and does not change afterward. Note however that as t increases above 
t nj fc, the cluster continues to grow. (See Figure 1). So how come the probability for 
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Figure 1: Approximate plots of (n 2 /k)K(sk(t/n)), together with the relative size of 
the giant component, for n = 200 and k = 100. The scaling factor in front is chosen 
so that if k/n — > a then the limiting step function takes the values and a. 

a cycle of size exactly ~n does not grow with it? The answer lies in examining a 
completely random permutation. The expected number of cycles of length k in a 
random permutation of n elements is exactly l /k and hence does not depend on n, of 
course, as long as n > k. This explains why the size of the giant component does 
not affect E(sfc), as long as it is bigger than k. 

The proof of Theorem 2 follows by estimating the terms in Theorem 1 carefully. 
Let us just remark how the phase transition comes about. The integrand in Theorem 
1 (i.e. 0) has a peak at exactly the critical value t n ^. So in the subcritical case (i.e. 
t small so x large), can be estimated simply by 0(x), which is small. In the 
supercritical case (i.e. x small), the integral is estimated by writing it as — Jjf\ 
The integral from to 1 is easy to calculate, and gives the x jk term, and the integral 
from to x is the error term. 

From this we can derive a new proof of the result, first proved by Schramm [14], 
that giant cycles (of macroscopic size) emerge at time 1/n with high probability 
asymptotically as n — > oo. 

Theorem 3. Let c > 1 and let t = c/n. Let C(t) denote the length of the largest 
cycle of o~ t - Then as e — > 0, 

limliminf P(C(i) > ne) = 1. 

e— >-0 n->oo 
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Schramm's results are stronger and also give information about the joint distri- 
bution of cycles, e.g. what is the probability that the largest cycle is in [0.6n, 0.7n] 
and the second largest is in [O.ln, 0.2n\. We will not discuss the joint distribution in 
this paper. 

Theorem 3 is the mean-field case of a well-known conjecture of Balint Toth [15] 
that the cycle structure of the interchange process on the graph Z d , d > 3, exhibits 
a phase-transition with infinite cycles appearing at some finite time (see e.g. [15] for 
definitions). This paper was initially motivated by the desire to see if it would be 
possible to use a representation-theoretic approach to this conjecture. Our proof of 
Theorem 3 can be seen as the mean-field case of this programme (see also Theorem 
1 of [2] for a possible alternative route, which yields a slightly weaker conclusion). 
Let us remark that the recent resolution of Aldous' spectral gap conjecture [5] could 
aid one in an algebraic attack on the non-mean-field case. 

Interestingly, the asymptotics leading to the transition in Theorems 2 and 3 
do not come from the contribution of a single representation of S n with all other 
contributions being negligible. Rather, it is the result of some very remarkable 
cancelations when summing up over all representations of S n . These cancelations 
lead to the simple formula of Theorem 1. Here it is essential to have an exact 
computation that keeps track of all representation in order to establish the emergence 
of giant cycles of Theorem 3. Nevertheless, if one wants only weaker results, e.g. that 
whenever t > 100t n ^ we already have |E(sfc(t)) — 1 /k\ exponentially small, then in this 
case a single representation (the trivial one, [n]) gives the main contribution, with 
the contribution of all the others combined negligible. In other words, ignoring the 
cancelation between representations will cause the information about the sharpness 
of the phase transition to be lost, but one may still show the existence of an "ordered 
phase" . 

Finally, we show how our formula allows us to recover a result of Berestycki and 
Durrett [3] concerning a slowdown transition for the interchange process. Let d(t) 
denote the distance between at and o"o, i.e., the minimal number of transpositions 
needed to write at as a product of transpositions. (If one views at as a random walk 
on the Cayley graph of S n generated by the set of transpositions, then d(t) is the 
graph distance between a t and a ). 

Theorem 4. Let c > and let t = c/n. Then as n — > oo, 

I 00 l.k—2 -i 

W ))^(c):=l-£— -(cO fc . (1) 

k=l 

The asymptotic behaviour (1) was first obtained in Theorem 3 of [3]. It is known 
that for c < 1, the function u(c) simplifies to u(c) = c/2 while for c > 1, u(c) < c/2. 
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The function u is C°° everywhere except at c = 1, for which u"{l) = — oo. This 
result is indicative of a slowdown transition, in the following sense: so long as c < 1, 
the random walk escapes from the origin at maximum speed (i.e., all but a negligible 
fraction of moves take it away from the origin), while for c > 1 the random walk starts 
decelerating brutally — indeed, the acceleration is — oo at c = 1. The methods of [3] 
relied heavily on the use of an associated Erdos-Renyi random graph process, and 
more precise results were obtained (e.g., a central limit theorem for d(t)). However we 
will see that the formula (1) is obtained with essentially no effort from our Theorem 
1. 

It is interesting to note that Theorem 4 is derived by applying Theorem 1 for 
constant k, unlike Theorems 2 and 3 for which the relevant regime is k x n. It 
would be interesting to find applications of our formula in other regimes of k, for 
example k = n — o(n). 



2.1 Preliminaries 

For a permutation a, let a^cr) denote the number of cycles of size k of a, and 
Sfc(t) = afc(<7 t ). For a representation p : S n —> GL(C dimp ) of S n , let Xp be its 
character, i.e., Xp( a ) = Tr(p(cr)). Note that ay.{a) is a class function. Hence, since 
characters form an orthonormal basis of class functions (see, e.g., Proposition 1.10.2 
in [13]) there exists a p G C such that 



where the sum is over irreducible representations p. In [1] the value of a p for all 
representations p was worked out. To describe this result we need to recall some 
standard background in the representation theory of the symmetric group. The 
irreducible representations of S n are parametrized by Young diagrams of size n, that 
is, sequences A = [Ai, . . . , Xj] with Ai > • • • > Aj > 1 and £\ Aj = n, which may be 
thought of as a collection of boxes sitting on top of one another, with A« boxes in 
row i. See [13] for the details of this parameterization (we will not use it directly in 
this paper). 

As it turns out, there are very few Young diagrams for which the corresponding 
coefficient a p defined by (2) is nonzero. To state things as simply as possible, we 
restrict ourselves to the case where k < n/2 (the other case is similar and we come 



2 Proofs 




(2) 



p 
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back to this later). Consider the representation pi = [n — k, k—i, V] where V indicates 
that there are % rows with exactly one box. Then Theorem 3 of [1] states that a p 
is zero unless p = p% for some < i < k — 1, in which case a p is simply equal to 
( — iy/k, or p = [n] the trivial representation, in which case a p = 1/k. 

Lemma 1. For a representation p, let r(p) = Xpi T )/d P be the character ratio of p, 
i.e., when r is any transposition, Xp( T ) = Tr(p(r)) is the character of p at t, and d p 
is the dimension of p. Then 

n*k{t)) = x> p d p ex P { (2) ( r 0°) - (3) 

Proof. To compute sj,(t), let Q be the infinitesimal generator of the random walk. 
That is, viewing Q as an element of the group algebra C Q = Yli<j[(hj) ~ id] 
(here, denotes the transposition of i and j and id is the identity permutation). 
Thus, P(<r t = s)= e tQ (o, s) = f(s), say. Then 



= Yl a px P {s)f{s) 

P s€Sn 

p \seSn / 

= J]a p Tr/(p), (4) 



where f(p) denotes the Fourier transform of / at p. Note that f(s) = YlT=o ^r^ fc Q fc (s)- 
Since convolutions become (matrix) products after taking Fourier transform, we get 

CO 

fc=0 

where the second exp is a matrix exponential. Now note that Q(s) is constant 
on conjugacy classes, i.e., is a class function (here is where we use that we do the 
interchange process on the complete graph, and not on any other graph). It easily 
follows from Schur's lemma that Q(p) = \ p I for some X p G M, and thus we conclude 
that 

Tip) = e tXp L (5) 
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To obtain the constant \ p we proceed as follows: on the one hand we have Tr Q(p) = 
d p X p , so X p = (1/dp) TtQ(p). On the other hand, Q = J2i<j[(^j) ~ id] so by linearity 
of the trace, 

Tr Q(p) = ~ Xp(id) = (fj ( Xp (r) ~ dp), 

where xp{ T ) denotes the character of r evaluated at an arbitrary transposition (again, 
it is not important which one, since characters are class functions). Hence, dividing 
by dp, we get 



K= UJ(Kp)-i) (6) 

where r(p) = Xp( T )/d P is the character ratio at a transposition. Combining (4), (5) 
and (6), we have arrived at 

as needed. □ 



2.2 Proof of Theorem 1, case < n/2. 

We now explain how to compute the various terms in the sum involved in (3) in 
order to get to the exact formula of Theorem 1. We assume for now k < n/2. 

Proof. We start by recalling Frobenius's formula, which gives the values of the char- 
acter ratios for any representation. Suppose p is parametrized by the Young tableau 
(pi, . . . ,Pj), then (see [6, Equation (D — 2), p. 40] or [7, Lemma 7]), 

r(p) = —^—j2p*-^- 1 ^- ( 7 ) 
mil — 1) A — ' 

v ' i=l 

Thus if p = pi is the representation given by pi — [n — k, k — i, V], < i < k — 1, 
then 

n(n — l)r(pi) — (n — k) 2 — (n — k) + (k — i) 2 — 3(k — i) 
+ 1-5 + ... + 1- (2(2 + 2) -1) 
= (n - k) 2 -{n-k) + k 2 - 2ik + i 2 - 3k + 3i - {i 2 + 3z) 
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n—k 



n — k + i + 1 



n — k 



n -2k + i + 2 



n — 2k + i 



k-i-l 



> i 



Figure 2: Sizes of the hooks for the diagram [n — k, k — i, V 



(n - k) 2 - n - 2k + k' 



lik. 



(8) 



Notice the cancelation of the terms in i 2 , which is crucial for the following calcula- 
tions. 

By the hook- length formula (see e.g. (4.12) in [10]), it is also easy to compute 
d Pi , the dimension of the representation pi (see Figure 2): 



dim(pj 



n\(n -2k + i + l) 



i\k(n - k)\(k - i - l)!(n - k + i + 1) ' 



(9) 



We now plug these various expressions into (3). Recall that a p = {—l) l /k when 
p = pi, and a p = 1/k when p is the trivial representation [n]. We obtain the following 
expression, after rearranging the terms (mostly to isolate the i-independent terms 
and take them out of the sum). Let m = k — 1, and x = exp(-kt). Then 



^/ / \ \ 1 ^ \ ^ ml 
E(s k (t)) = - + C n , k J2 Tf 



n - 2k + i + 1 



i=0 



i\{m — i)\ n — k + i + 1 



-l) l x l 



(10) 



where 



a 



n,k 



1 n! f fn\ ( (n-kf -n-2k + k 2 

kW^W\^\\2 ] 



n 



n(n — 1) 

2k + k 2 ) -t 



n — n 



11^ 
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To compute the right-hand side of (10), note that 
hence 



U -1)V = (1 -x) m , 

i=U i\(rn— 1)\ v / v. / ' 



E(s fc (t)) = t + 2^ — = : — -^-^ V 



z'!(m — z)! n — k + z + l 



-xY l 



n — k + z + 1 



i=0 v 7 

-j- + C„, fc (l - x) m - C n , fc £; £ (-x 

i=0 ^ 7 



n — m + z 



- + Ai - A 2 , 



say. 



This can be simplified a little bit. Observe indeed that 

1 



i=0 ^ 7 



n — m + z 



i=0 



n— m+i 



i=0 



-1 



rz — m + i 
n - m+i ~ x dy 



y n - k (l-y) m dy 



On the other hand, A 1 = - x) m x n " m , 

expression for E(sfc(t)): with x = e~ tk , 



so we are led to the following exact 



E(s k {t)) 



k \k 



k' 



x n - m (l-x) rn - / y n ~ k (l-y) m dy 



Recalling the notation from the statement of the theorem, we can rewrite this 
shortly as 



E(s k (t)) 



k \k 



x<p(x) - / <f)(y) dy 



:i2) 



The theorem is basically finished: we only need the following simple transformation: 
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Lemma 2. 



Proof. Denote 



<t>{v) dy = t 



<fr(y) dy 



Kx) 



(p(y)dy- / <fr(y)dy. 



The integral from to 1 is just the Beta integral, so 

1 n _ fc/1 T(n-k + l)T(k) 

y v yj y T(n-k+l + k) 

For completeness, here is a short proof: Fubini's theorem shows that for every x, y > 
0, F(x)r(y) = J °° J °° e - u ~ v u x ~ 1 v y- 1 dudv, hence after a change of variables z = 
u + v, t = u/z, so that dudv = z dz dt, we get 

POD fl 

T{x)T{y)= I e~ z z x+y - l dz t x ~\l - t) y ~ l dt. 
Jo Jo 



Hence L t x 1 (l — t) v 1 dt = T(x)T(y)/T(x + y), as required. 
Multiplying by (^) gives the |, showing the lemma. 

Thus the | from lemma 2 cancels with the | in (12) and leads us to 

1 „ 



E(s k (t)) 



-x 



l-x) k ^ + I(x) 



□ 



(13) 



(with I as in the proof of lemma 2). This finishes the proof of Theorem 1 in the case 
k<n/2. □ 



2.3 Proof of Theorem 1, case k > n/2. 

The calculation in this case is very similar to the one in the previous case. In fact, 
we find the similarity eerie and needing of explanation. What we will show is that 
(10) holds, which means that all the calculation from (10) on holds as is, including 
the conclusion of the theorem. So we only need to show (10). 

Recall from [1] that if k > n/2, the representations that arise in the character 
decomposition of a k are of two types: for < i < 2k — n — 2, we have pi = 
[k — i — 1, n — k + 1, F], for which Oj = (— l) t+1 /k, while for 2k — n < i < k — 1, 
we have pi = [n — k,k — coming with a coefficient Oj = (— Vf jk. Note in 
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k-i-1 

A 



jfe-i-1 



2k — n — i 



2k — n—i — 2 



n — k + i + 1 



n — k 



Figure 3: Sizes of the hooks for the diagram [k — % — l,n — fc + 1, P 



particular, that for 2fc — n < i < k — 1, the representation and the coefficient is 
formally identical to those used in the case /c < n/2, and hence the corresponding 
dimension and character ratio are still given (formally) by the same expressions (8) 
and (9). Note also that there is no representation associated with % — 2k — n — 1. 

For < i < 2k — n — 2, consider the representation pi = [k — i — 1, n — k + 1, l l ] 
for which the coefficient is a Pi = (—l) l+1 /k. Then a calculation using Frobenius' 
character formula shows (still keeping the notation m = k — 1), 

n(n — l)r(pj) = (m — z) 2 — (m — i) + (n — m) 2 — 3(n — m) 
+ l-5 + ... + l-(2(i + 2)-l) 
— (m — i) 2 — (m — i) + (n — m) 2 — 3(n — m) — (i 2 + 3z) 
= -2%k + (n - k) 2 + k 2 - 2k - n 

Comparing with (8), observe that this is exactly the same expression. 

It remains to compute d(pi). In order to do so we use again the hook length 
formula, which gives us the following (see Figure 3): 

n\(2k — n — i — 1) 
^ = i\k{n - k)\{k - i - \)\{n - k + i + 1)' 

Formally, this is once again exactly the same result as in case k < n/2 but with a 
minus sign. But since in this case Oj = (— l) l+1 /k instead of (— l) l /k, the two minus 
signs cancel out and the formula which computes E(s fc (t)) is exactly the same, except 
for the fact that there is no term corresponding to i — 2k — n — 1. However, observe 
that the term in (10) corresponding to i = 2k — n — 1 is zero, so it does nothing to 
the sum when we keep it. 
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This shows that (10) holds also when k > n/2 and we conclude that, even when 
k > n/2, 

V 

This finishes the proof of Theorem 1. □ 



E{s k {t)) 



\x n - m (l -xf- 1 +I(x) 
k 



2.4 Proof of Theorem 2 

We need to estimate the terms appearing in Theorem 1. For this we denote B\ = 
j:(2)x n ~ k (l — x) fc ~ 1 and B 2 = (fyl{x) where I is as in the previous section. With 
these definitions K{s k {t)) = xB\ + B 2 (the reason for taking one x out of Bi is 
technical — it allows us to use the same notations for the estimate of B\ and of B 2 , 
nothing more). Another useful notation would be 

ipnM = {n-k) log(y) + {k - 1) log(l - y), 

so that the x-dependent term in B\ is and the integrand in I{x) is e^^ y \ Note 
that ip ny k has a unique maximum over [0, 1], attained at y n ^ = (n — k)/(n — 1). The 
key lemma is the following estimate: 

Lemma 3. There exists C > such that for all k < n, and for all x G [0, 1] we 
have: 

gl <^ exp / min My4 2 (n-fen e = *-iM (14) 

k y 4 J y nyk 

where q = n 3 ^ 2 k~ 3 ^ 2 (n — k)~ 1 ^ 2 , as in the statement of theorem 2. 

Proof. We start by defining 

$n,k( e ) := eX P (i'nAVnA 1 + £ ))) ( 15 ) 

and then 

$ n , fc ( £ ) = exp I {n-k) log (^—^{l + e)) +(*-!) log (l 



K n — 1 / \ n — 

= exp {(ip ntk (y n ,k) + 8} . 

To bound 8 we first assume that |e| < 1 /a and get 

5= {n-k) log(l + e) + (k - 1) log f 1 - 5^— ^ j < 



fc 
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< (n - k)e - (n - kfj + (k - l)(- s )±Ji 

< (16) 

where we have used for the first log the fact that for |e| < 1/4, log(l + e) < e — £ 2 /4, 
and for the second log simply log(l + a) < a. To remove the restriction on e, note 
that ip nt k is monotone increasing over [0, y^fc] and monotone decreasing over [y n) k, 1]- 
So we can write 5 < — |(n — k) min{|e|, 1/4} 2 , for all s. 

On the other hand, from Stirling's formula it is easy to see that there exists a 
universal constant C such that for all n and all k < (n/2), 



n )<C H n (17) 

k) " V k{n-k)k k {n-k) n - k ' y ' 



Combining (17) and (16) we get 

7l\ I Tl f Tl\ i 71 

$n,fe(e) < C< I — - exp i k log ^-J + (n - k) log I 



k) ' y k(n — k) \ \kJ \n — k 

+(n - k) log f ^— - J + (k - 1) lo! 



n — 1 J \n — 1 



= c \/*^i) ' H exp {" log fe=i) + k log (V) + '} 
< , _(.^!Vj)l (18) 



fc 3/2( n _ A .)l/2 

Multiplying further by we get the claimed bound on Si in Lemma 3. □ 
Lemma 4. When x > y n ,k{^ + £) £ > 

B^Cgexpj- ^^^ } ^ 

wMe if x < y n ,k(X + e) arad e < 0, 

1 



< C „j.l?*|Vt) ( . (20 ) 



where q = q n (k) is as in Lemma 3 and Theorem 2. 
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Proof. We analyze these two cases separately: 

Case 1: x > y n ,k(^ + s) with e > 0. Then, since ip n ^ is monotone decreasing after 
y nk , we have 



I(x) = / e My) dy < e 



and hence 



(21) 



Case 2: x < y n ,k(l + £) with e < 0. As explained in the introduction, here we will 
Lemma 2, which gives us 



\ n \ I ( x) = l-( n \ I e *»*Mdy 



k \k 



o 



and this time, since ip ntk is monotone increasing over [0, y n ,k{^ + £ )] we conclude that 



so that 

- 1A| < (22) 

Hence in both cases (21) and (22), we conclude by (18). □ 

With these two lemmas, we are now able to finish the proof of Theorem 2. 
Proof. Suppose t = t n ^ k + e, with £6l. Then 

x = e~ tk = e~ ktn ' k e~ k£ = y n ,ke~ k£ 



so 



\% yn,k\ 
Vn,k 



e~ k£ - 1| > cmin{|fce|, 1} = cmin{£;|t - t n>k \, 1}. (23) 



Thus 



E(s k (t)) - -^1^} 



xB 1 + B 2 - j_l{t>t n , k } 



By lemmas 3 and 4 < Cgexp 



min{|x - y n ,k\/yn,k, l/4} 2 (n - k) 



By (23) < Cgexp ( - cmin{fc|t - t n>k \, l} 2 (n - k)). 



As needed. 



□ 
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2.5 Proof of Theorem 3 

We now turn to the proof of Theorem 3. Let t = c/n and let X n (e) = - J2 k>n£ ksk(t) 
be the relative mass of cycles greater than ne. Then we have the following lemma: 

Lemma 5. Assume c > 1 and let 6(c) = max{z > : 1 — z > e~ zc }. Then 6(c) > 
and 

liminfE(X n (e)) > 6(c) - e. 

n— >oo 

Proof. Fix a < 6(c) and let en < k < an. It follows that 

c - 5 

tnk < for some 5 > 0. (24) 

n 

Indeed, k/n < a < 6(c) gives that 1 - k /n > e ~ (c ~ <5)(fc/n) and with 

g-ttn,* ^ f n ~ k > l_ 1 l > e -(c-8)(k/n) 

n — 1 n 

we get (24). Hence by Theorem 2, noting that q n (k) < C for any k in this range, 

H$k(t)) > j-Cexp(-S'n) 

for some C, 8' > depending on a and c. Summing up between k = \en] and 
k = [an\ , we get 

E(X n (e)) > (a - e) - Cn 2 exp(-tf'ra) 

The lemma follows immediately by letting n — >• oo, since a was arbitrarily close to 
6(c). □ 

Lemma 6. For any e > 0, on an event of probability tending to 1 as n — >• oo, 

X n (e) <6(c)+e. 

Proof. Let G(t) be the (random) graph where there is an edge if and only if the 
transposition (i,j) has occurred prior to t. Then G(t) is a realization of G(n,p) with 
p — 1 — exp(— c/n) ~ c/n. Furthermore, it is well-known and easy to see that any 
cycle of at is a subset of some connected component of G(t). By the Erdos-Renyi 
theorem (see, e.g., [8, 9]), it is known that on an event of high probability, G(t) has 
only one component greater than en (the giant component) and thus X n (e) < Y n , 
where Y n is the rescaled size of the largest component. Furthermore, the same papers 
show 

n 

in probability, where £(c) is the survival probability of a Poisson (c) Galton- Watson 
process. Hence £(c) = 6(c) and the lemma follows. □ 
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Proof of Theorem 3. Recall that the theorem states that a cycle > en exists with high 
probability. Consider the random variable E n = 6(c)+e—X n (e). Then E n is bounded 
by 2, nonnegative with high probability (from the Galton- Watson argument), and 
has expectation bounded by 2e (from Lemma 5). Hence by Markov's inequality 
IP (.En > y/z) < 3\/e for all n sufficiently large. On the complement event X n (e) > 
9(c) — \fe and hence X n (e) > 0, in which case the length of the longest cycle is > ne. 
Since e was arbitrary, Theorem 3 follows. □ 



2.6 Proof of Theorem 4 

Recall that we wish to estimate d(t), the graph distance to o~(t). Let N(t) denote 
the number of cycles of a t . It is well known and easy to check that d(t) = n — N(t). 
Let e > and fix K > 1/e. Then observe that, since there can never be more than 
ne cycles greater than K, 

1 1 K 

-E(N(t)) = -y2E(s k (t)) + 0(e). 
n n L — ' 

fc=i 

Thus Theorem 4 follows from the claim: for all fixed k > 1, and all c > 0, if t = c/n, 

±E(* fc (t)) k ^-\{ce- c f- (25) 

For this we apply Theorem 1 with these values of k and t, and note that x = e~ kt 
satisfies x — > 1, 1 — x ~ kc/n and x n ~ k — > e~ ck (the notation a ~ b is short for 
a = 6(1 + o(l)), here and below). Since m ~ n k /k\, we deduce immediately that 
(recall that (f>(x) = x n ~ k (l - x)^ 1 ), 

Moreover, it is plain that -m0(y) is uniformly bounded for x < y < 1, hence 

/ 0(2/)*/ < C(l - x) 0. 
(25) follows immediately, hence so does Theorem 4. 
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